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Dear Sir: 



1. I, Filippo M. Randazzo, declare and say I am a resident of the State of California My 

residence address is 104 Capricorn Avenue, Oakland, CA 9461 1. 



2. I hold a B.S. degree in Molecular Microbiology and Anthropology, which I received fiom 
the University of Notre Dame in 1985. 1 further hold a Ph.D. degree, which I received fix)m 
Indiana University in 1991. I am skilled in the fields of genetics, molecular biology, 
developmental biology genomics and cancer biology. I am a co-inventor of the invention 
claimed in the above-referenced patent application. 

3. I, George F. Lamson, declare and say I am a resident of the State of California. My 
residence address is 232 Sandringham Dr., Moraga, CA 94556. 

4. I hold a BS degree in Biochemistry, which I received from the University of CA, Santa 
Barbara in 1976. 1 further hold a PhD degree, which I received from University of CA, Berkeley, 
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in 1982. 1 am skilled in the fields of Bioinformatics. I am a co-inventor of the invention claimed 
in the above-referenced patent application, 

5. I have reviewed the relevant portions of the Office Action (specifically section nos. 6 
and 10), mailed October 2, 2001, in the above-referenced application. I understand that claims 
58, 60-66, 116, and 126 of the above-referenced patent application are rejected under 35 U.S.C. 
§101 on the grounds that the claimed invention lacks patentable utility, and also under 35 U.S.C. 
§ 1 1 2, If 1 , on the grounds that since the claimed invention is not supported by a patentable utility, 
one skilled in (he art would not know how to use the claimed invention. 

6. This Declaration provides fiirther evidence of the patentable utility of the claimed 
invention. Specifically, this Declaration provides evidence that the nucleotide sequence 
designated SEQ ID NO: 1 186 represents genes that are differentially expressed in cancer cells, 
thus supporting the assertion that the claimed invention has utility in detecting cancer cells. 

7. The following experiments were conducted by me or under my direction. 

8. Genes differentially expressed in cancerous cells were identified as detected by 
microarray hybridization analysis using materials obtained firom patient colon tissue samples. 
The biological materials used in these experiments, the methods of analysis, and the results are 
described below. 

9. Source of patient tissue samples. Nomial and cancerous tissues were collected firom patients 
using laser capture microdissection (LCM) techniques, which techniques are well known in the 
art. Table 1 (Attachment 1) provides information about each patient firom which the samples 
were isolated, including: the Patient ID ("PT ID") and Path ReportID ("Path ID"), which are 
numbers assigned to the patient and the pathology reports for identification purposes; the group 
("Grp") to which the patients have been assigned; the anatomical location of the tumor ("Anatom 
Loc**); the primary tumor size ("Size"); the primary tumor grade ("Grade"); the identification of 
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the histopathological grade ("Histo Grade"); a description of local sites to which the tumor had 
invaded ("Local Invasion"); the presence of lymph node metastases ("Lymph Met"); the 
incidence of lymph node metastases (provided as a number of lymph nodes positive for 
metastasis over the number of lymph nodes examined) ("Lymph Met Incid"); the regional 
lymphnode grade ("Reg Lymph Grade"); the identification or detection of metastases to sites 
distant to the tumor and their location ("Dist Met & Loc"); the grade of distant metastasis ("Dist 
Met Grade"); and general comments about the patient or the tumor ("Comments"). 
Histopathology of all primary tumors indicated the tumor was adenocarcinoma except for Patient 
ID Nos. 130 (for which no infonmation was provided), 392 (in which greater than 50% of the 
cells were mucinous carcinoma), and 784 (adenosquamous carcinoma). Extranodal extensions 
were described in three patients. Patient ID Nos. 784, 789, and 791 . Lymphovascular invasion 
was described in Patient ID Nos. 128, 228, 278, 517, 534, 784, 786, 789, 791, 890, and 892. 
Crohn's-like infiltrates were described in seven patients. Patient ID Nos. 52, 264, 268, 392, 393, 
784, and 791. 

10. Source of polynucleotides on arrays. Polynucleotides spotted on the arrays were 
generated by PGR amplification of clones derived fi-om cDNA libraries. The clones xised for 
amplification were either the clones fi"om which the sequences described herein were derived, or 
are clones having inserts with significant polynucleotide sequence overlap with the sequences 
described herein as determined by BLAST2 homology searching. 

11. Microarray Design. Each array used in the examples below had an identical spatial 
layout and control spot set. Each microarray was divided into two areas, each area having an 
array with, on each half, twelve groupings of 32 x 12 spots for a total of about 9,216 spots on 
each array. The two areas are spotted identically which provide for at least two duplicates of each 
clone per array. Spotting was accompUshed using PGR amplified products fi-om 0.5kb to 2.0 kb 
and spotted using a Molecular Dynamics Gen DI spotter according to the manufacturer's 
reconmiendations. The first row of each of the 24 regions on the array had about 32 control 
spots, including 4 negative control spots and 8 test polynucleotides. The test polynucleotides 
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were spiked into each sample before the labeling reaction with a range of concentrations from 2- 
600 pg/slide and ratios of 1 : 1 . For each array design, two slides were hybridized with the test 
samples reverse-labeled in the labeling reaction. This provided for about 4 duplicate 
measurements for each clone, two of one color and two of the other, for each sample. 

12. Microarray Analysis. cDNA probes were prepared from total RNA isolated from the 
patient cells described in Table 1 (Attachment 1). Since LCM provides for the isolation of 
specific cell types to provide a substantially homogenous cell sample, this provided for a 
similarly pure RNA sample. Total RNA was first reverse transcribed into cDNA using a primer 
containing a T7 RNA polymerase promoter, followed by second strand DNA synthesis. cDNA 
was then transcribed in vitro to produce antisense RNA using the T7 promoter-mediated 
expression, and the antisense RNA was then converted into cDNA. The second set of cDNAs 
were again transcribed m vitro, using the T7 promoter, to provide antisense RNA. Optionally, the 
RNA was again converted into cDNA, allowing for up to a third round of T7-mediated 
amplification to produce more antisense RNA. Thus the procedure provided for two or three 
rounds of in vitro transcription to produce the final RNA used for fluorescent labeling. 
Fluorescent probes were generated by first adding control RNA to the antisense RNA mix, and 
producing fluorescently labeled cDNA from the RNA starting material. Fluorescently labeled 
cDNAs prepared &om the tumor RNA sample were compared to fluorescently labeled cDNAs 
prepared from normal cell RNA sample. For example, the cDNA probes from the normal cells 
were labeled with Cy3 fluorescent dye (green) and the cDNA probes prepared bom the tumor 
cells were labeled with Cy5 fluorescent dye (red). 

1 3 . The differential expression assay was perfomied by mixing equal amounts of probes from 
tumor cells and normal cells of the same patient ("matched") or from tumor cells and normal 
cells of different patients CWnatched") {i.e., the tumor cells are from one patient and the normal 
cells are from a different patient). The arrays were prehybridized by incubation for about 2 hrs at 
60°C in 5X SSC/0.2% SDS/1 mM EDTA, and then washed three times in water and twice in 
isopropanol. Following prehybridization of the array, the probe mixture was then hybridized to 
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the array under conditions of high stringency (overnight at 42°C in 50% formamide, 5X SSC, 
and 0.2% SDS. After hybridization, the array was washed at 55°C three times as follows: 1) first 
wash in IX SSC/0.2% SDS; 2) second wash in O.IX SSC/0.2% SDS; and 3) third wash in O.IX 
SSC. The arrays were then scanned for green and red fluorescence using a Molecular Dynamics 
Generation in dual color laser-scanner/detector. The images were processed using BioDiscovery 
Autogene software, and the data from each scan set normalized to provide for a ratio of 
expression relative to normal. Data from the microarray experiments was analyzed according to 
the algorithms described in U.S. ^plication serial no. 60/252,358, filed November 20, 2000, by 
E.J. Moler, M.A. Boyle, and F.M. Randazzo, and entitled "Precision and accuracy in cDNA 
microarray data," The experiment was repeated, this time labeling the two probes with the 
opposite color in order to perform the assay in both "color directions." Each experiment was 
sometimes repeated with two more slides (one in each color direction). The level fluorescence 
for each sequence on the array expressed as a ratio of the geometric mean of 8 replicate 
spots/genes from the four arrays or 4 replicate spots/gene from 2 arrays or some other 
permutation. The data were normalized using the spiked positive controls present in each 
duplicated area, and the precision of this nonnalization was included in the final determination of 
the significance of each differential. The fluorescent intensity of each spot was also compared to 
the negative controls in each duplicated area to determine which spots detected significant 
expression levels in each sample. 

14. A statistical analysis of the fluorescent uitensities was jqjplied to each set of duplicate 
spots to assess the precision and significance of each differential measurement, resulting in ap- 
value testing the null hypothesis that there is no differential in the expression level between the 
tumor and normal samples of each patient. For initial analysis of the microarrays, the hypothesis 
was accepted if p>10"\ and the differential ratio was set to 1 .000 for those spots. All other spots 
have a significant difference in expression between the tumor and normal sample. If the tumor 
sample has detectable expression and the normal does not, the ratio is truncated at 1 000 since the 
value for expression in the normal sample would be zero, and the ratio would not be a 
mathematically usefiil value (e.g., infinity). If the normal sample has detectable expression and 
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the tumor does not, the ratio is truncated to 0.001, since the value for expression in the tumor 
sample would be zero and the ratio would not be a mathematically useful value. These latter two 
situations are referred to herein as "on/ofF." Database tables were populated using a 95% 
confidence level (p>0.05). 

15. In general, a polynucleotide is said to represent a significantly differentially expressed 
gene between two samples when there is detectable levels of expression in at least one sample 
and the ratio value is greater than at least about 1 .2 fold, preferably greater than at least about 1 .5 
fold , more preferably greater than at least about 2 fold, where the ratio value is calculated using 
the method described above. A differential expression ratio of 1 indicates that the expression 
level of the gene in the tumor cell was not statistically different &om expression of that gene in 
normal colon cells of the same patient. A differential expression ratio significantly greater than 1 
in cancerous colon cells relative to normal colon cells indicates that the gene is increased in 
expression in cancerous cells relative to normal cells, indicating that the gene plays a role in the 
development of the cancerous phenotype, and may be involved in promoting metastasis of the 
cell. 

1 6. Table 2, shown below, summarizes the results of the differential expression analysis in 
colon tissue. The table provides: (1) the SEQ ID NO of the polynucleotide corresponding to the 
polynucleotide on the spot on the array; (2) the Clone Name; (3) whether the tumor cells and 
normal cells were Scorn the same patient ("matched) or were bom different patients 
('^unmatched"); (4) the number of patients tested ("No. Tested"); (5) the percentage of patients 
tested in which detected expression level was greater than or equal to a 2-fold increase in 
cancerous tissue relative to normal tissue (">=2x"); (6) the percentage of patients tested in which 
detected expression level was greater than or equal to a 5-fold increase in cancerous tissue 
relative to normal tissue (">=5x"); and (7) the percentage of patients tested in which detected 
expression level was less than or equal to 14 of the expression level in normal cells ("<=0.5x"). 
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TABLE 2 



SEQ ID 
NO: 


Clone Name 


Matched/ 
Unmatched 


No. 
Tested 


>=2x 
95% conf. 


>=5x 
95% conf. 


<=0.5x 
95% conf. 


1186 


M00032614C:B10 


matched 


40 


67.5 


12.5 


48.48 


1186 


M00032614C:B10 


unmatched 


28 


42.9 


0.0 


3.6 



1 7. The data above support the assertion that a polynucleotide having a sequence of SEQ K) 
NO: 1 186 represents genes that are differentially expressed in cancer cells, thus supporting the 
assertion that the claimed invention has utility in detecting cancer cells. Specifically, detection 
of gene products that correspond to genes having a sequence of SEQ ID NO: 1 1 86 can provide 
an indicator that the cell is cancerous, and may provide a ther^eutic and/or diagnostic target. 
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18. I, Filippo M. Randazzo, hereby declare that all statements made herein of my own 

knowledge are true and that all statements made on information and belief are believed to be 
true; and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, under Section 
1 001 of Title XVin of the United States Code, and that such will false statements may jeopardize 
the validity of the application or any patent issuing thereon. 



Date Filippo M. Randazzo 



19. I, George Lamson, hereby declare that all statements made herein of my own knowledge 
are true and that all statements made on information and belief are believed to be true; and 
further that these statements were made with the knowledge that willful false statements and the 
like so made are punishable by fine or imprisonment, or both, under Section 1 001 of Title XVin 
of the United States Code, and that such will false statements may jeopardize the validity of the 
apphcation or any patent issuing thereon. 

Date George Camson 

Attachments: Table 1 of patient data 

F:\DOCUMENT\2300\1481\DeciratiGn of RandazzoUmson for SEQ ID NO 1 186xloc 
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O 
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U9 



9J 



9t 
U 

o 



'a 



cd 
C 

2 S 
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o 



CO 



2a 



00 



8 



o OS 
< P 



Comment 


moderately to 
poorly 

differentiated 
primary tumor 








moderately to 
I poorly 
differentiated 


j moderately to 
poorly 

differentiated 

metastatic 

tumor 
















Dist 
Met 
Grade 


Ml 


Ml 


Ml 


Ml 


Ml 


Ml 


Ml 


Ml 


Ml 


Ml 


Ml 




Ml 


Dist Met 
& Loc 


Pos-liver 


Pos-liver. 
diaphrag 

m, 
gallbladd 
er nodule 


Pos-liver, 
peritoneu 
m 


Pos-liver 1 


Pos-liver 


Pos-liver 


Pos-liver 


Pos-liver 


Pos-liver 


Pos-liver 


Pos-liver 


Pos-liver 


Pos-liver 


Reg 
Lymph 
Grade 


IN 


























Lymph. 
Met 
Incid 


2/11 


























Lymph 
Met 


Pos 


Neg 


Neg 


1 Neg 1 


Neg 


Neg 


1 Neg 1 


Pos 




Neg 


Neg 


Neg 




Local Invasion 


through 

muscularis propria 
into perirectal soft 
tissue 


























Histo 
Grade 


G3 










G3 


ZO 










G2 




Grade 


m 
H 




























q 


























Anatom 
Loc 


Descending 
colon 


























Grp 


AI 










AI 






AI 








^ 
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00 
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m 

00 


rs 
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oo 


ro 

CO 
00 


oo 


»n 
m 

00 


VO 
m 

00 


m 

00 



9 



O OS 

CO 



Comment 


well to 
moderately 
differentiated 
adenocarcinom 
as; this patient 
has tumors of 
the ascending 
colon and the 
sigmoid colon 


moderately 
differentiated 
adenocarcinom 
a 




Perineural 

invasion 

present. 


Perineural 

invasion 

present, 

extensive. 

Patient with a 

history of colon 

cancer. 


Dist 
Met 
Grade 


i 


S 


i 






Dist Met 
&Loc 


Pos- 
Liver 


Pos- 
Liver 


Pos- 
Livcr 


Pos- 
Liver 


Pos- 
Liver, left 
and right 

lobe, 
omentum 


Reg 
Lymph 
Grade 




2 


s 




2 


Lymph. 
Met 
Incid 


3/21 




11/15 


4/15 


1/28 


Lymph 
Met 


CO 


CO 


CO 


CO 

o 


CO 


Local Invasion 


Into muscularis 
propria 


Through 

muscularis propria 
int subserosal 
tissue 


Through 

muscularis propria 
into subserosa. 


Invasion through 
muscularis propria 
into perirectal soft 
tissue 


Invasion into 
pericolic sort 
tissue. Tumor 
focally invading 
skeletal muscle 
attached to colon. 


Histo 
Grade 




O 


O 


O 


O 


Grade 


p 


m 
H 


m 
H 




H 




O 


00 






p 


Anatom 
Loc 


Ascending 
colon . 


Cecum 


Ascending 
colon 


Rectum 


Sigmoid 


Grp 














00 
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o 
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o 
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00 
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00 
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o 

OS 

00 


Ov 
OO 


fS 
Ov 
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10 



o o\ 



s 

o 

08 



Comment 


Perineural 
invasion focally 
present. 
Omentum 
mass, but 
resection with 
no tumor 
identified. 


Dist 
Met 
Grade 


Ml 


<^ 

?^ 


Pos- 
Liver 


Reg 
Lymph 
Grade 


N2 


Lympb 
Met 
Incid 


14/17 


Lymph 
Met 


Pos 


Local Invasion 


Through 

muscularis propria 
into pericolic fat 


Histo 
Grade 


G2-G3 


Grade 


T3 




0 

NO 


Anatom 
Loc 


Transverse 
colon 


Grp 






OS 


PtID 


OS 
00 
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> NCBl 



Blast 2 Sequences results 

PubMed Entrez BLAST OMIM Taxonomy Structure 

BLAST 2 SEQUENCES RESULTS VERSION TBLASTX 2.2.2 [Dec-14-20011 



Matchil^TI j Mis match:!^! 
x_dropoff: 1!° ^ 



ismatchil ^Hj ga p open:PV Jg^ extension: I- — ; 
Jexpect:llggwor<isize:pFiltern 



Sequence 



vd20e09.sl Knowles Solter mouse 2 cell Mus musculus 
Sequence gi cDNA clone IMAGE:793096 5' similar to TR:G555806 
2 2156932 G555806 4-HYDROXYPHENYLPYRUVATE ACID 

DIOXYGENASE. :. 



c 



Length 299 ^^^^'^ 



Length 415 



(1 

415) 



NOTE:The statistics (bitscore and expect value) is calculated based on the size of nr database 



Score = 104 bits (222), Expect = 2e-21 
Identities = 39/50 (78%) 

Frame = +1 / +3 
I I 



Query: 4 LTLACTPGSSPTLLRWFHDCLGFCHLPLSPGEDPELGLEMTAGFGLGGMR 153 

LTIiACT GSSP L+RWFHDCLGF HLPLSPGEDPE+GL++ AG G GG+R ( I 

Sbjct: 3 LTLACTSGSSPMLMRWFHDCLGFHHLPLSPGEDPEMGLKVAAGSGRGGLR 152 

Score = 93.2 bits (197), Expect = 6e-18 
Identities = 37/48 (77%) 
Frame = -1 / -1 

I I 



Query: 146 PPSPNPAVISRPSSGSSPGLSGKWQKPRQSWNQRKSVGELPGVQAKVR 3 

PP P+PA RP SGSSPGLSGKW KPRQSWNQR S+GELP VQAKV+ 
Sbjct: 145 PPRPDPAATLRPI SGSSPGLSGKW* KPRQSWNQRISIGELPEVQAKVK 2 



http://www.ncbi.nlm.nih.gov/blast/bl2seq/wblast2.cgi75 
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Score = 84.5 bits (178), Expect = 3e-15 
Identities = 35/48 (72%) 
Frame = -3 / -3 




Query: 147 SPKPKPCCHFKAQLGIFTWAQRQVAKAQAWEPAQKCGGAAGGAGQGQ 4 

SP P+PC H +A L I TWAQRQV KA AWEPA K GAAGGAGQGQ /O 
Sbjct: 146 SPSPRPCSHLEAHLRILTWAQRQVVKA*AWEPAHKHRGAAGGAGQGQ 3 

Score = 81.7 bits (172), Expect = 2e-14 
Identities = 35/49 (71%) 
Frame = +2 / +1 



Query: 2 T*PWPAPPAAPPHFCAGSTTAWAFATCR*AQVKIPSWALK*QQGIiGLGE 148 

T*PWPAPPAAP CAGSTTA AF TCR*AQV+I WA + QGLG G+ 
Sbjct: 1 T*PWPAPPAAPLCLCAGSTTA*AFTTCR*AQVRIRRWASRWLQGIiGEGD 147 

Score = 73.9 bits (155), Expect = 4e-12 
Identities = 33/47 (70%) 
Frame = +3 / +2 

I ■ 

I I 



Query: 6 DLGLHPRQLPHTFALVPRLPGLLPLAAEPR*RSRAGP*NDSRVWAWG 146 

DLGLH RQLP+ +ALVPRLP L PLAAEPR* S GP RVWA G f 
Sbjct: 5 DIiGLHLRQLPYAYAIiVPRLPRLSPLAAEPR*GSGDGPQGGCRVWARG 145 

Score = 70.3 bits (147), Expect = 5e-ll 
Identities = 33/47 (70%) 
Frame = -2 / -2 



Query: 145 PQAQTLLSFQGPARDLHLGSAASGKSPGSRGTSAKVWGSCRGCRPRS 5 

P AQTL GP+ D HLGSAASG+S GSRGTSA GSCR CRPRS V 
Sbjct: 144 PLAQTLQPP*GPSPDPHLGSAASGESLGSRGTSA*A*GSCRRCRPRS 4 

CPU time: 0.07 user sees. 0.03 sys. sees 0.10 total sees. 

Lambda K H 

0.318 0.135 0.401 



Matrix: BLOSUM62 

Number of Hits to DB: 623 

Number of Sequences: 0 

Number of extensions: 18 

Number of successful extensions: 6 

Number of sequences better than 10.0: 1 

length of database: 1,668,972,645 

effective HSP length: 54 

effective length of database: 1,367,552,853 
effective search space used: 61539878385 
frameshift window, decay const: 50, 0.1 
T: 13 



http://www.ncbi.nlm.nih.gov/blast/bl2seq/wblast2.cgi?5 
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A: 40 

XI: 16 ( 7.3 bits) 
X2: 109 (49.9 bits) 
SI: 41 (21.7 bits) 
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NCB 



I 



Blast 2 Sequences results 

PubMed Entrez BLAST OM!M Taxonomy Structure 

BLAST 2 SEQUENCES RESULTS VERSION TBLASTX 2.2.2 [Dec-14-20011 



Match:!? J Mismatch:!? ! gap open:!!! \ gap extension; 



x_dropofF: 1 expect 



l^wordsize:!!!!^ Mil 



Sequence j^jj^^^^ 



Length 300 ^QQj 

zd77c01.rl Soares_fetal_heart_NbHH19WHomo sapiens 
Sequence gi cDNA clone IMAGE:346656 5' similar to contains Alu , , (I .. 
2 1423522 repetitive element;contains element TARl repetitive ^ 446) 

element ;. 

2 



I L 




NOTE:The statistics (bitscore and expect value) is calculated based on the size of nr database 



Score = 70.7 bits (148), Expect (2) = 2e-18 
Identities = 28/29 (96%) 
Frame = -1 / -2 
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Query: 216 SAHRNLHLLKGF*WQNVSIiNAGPRITAPA 130 

SAHRNLHLLKGF*WQN+SLNAGPRITAPA 
Sbjct: 445 SAHRNLHLLKGF*WQNLSLNAGPRITAPA 359 

Score = 70.3 bits (147), Expect = 5e-ll 
Identities = 32/47 (68%) 
Frame = +1 / +2 



JT 



r 



Query: 76 RHEPPRIiAKGFNELMDVQCWGCYSRACIETHILPSKAF*EVEVAVS* 216 

RHE K CWGCYSRACIET ILPSKAF*EVEVAVS* 7 2- 

Sbjct: 305 RHEHHAWPKDLMN*WMYSCWGCYSRACIETQILPSKAF*EVEVAVS* 445 

Score =70.3 bits (147), Expect (2) = le-18 
Identities = 28/29 (96%) 
Frame = +2 / +3 



Query: 131 AGAVILGPALRLTFCHQKPFKRWRLR*AD 217 

AGAVILGPALRL FCHQKPFKRWRLR*AD / J 

Sbjct: 360 AGAVILGPALRLKFCHQKPFKRWRLR*AD 446 

Score =68.0 bits (142), Expect (2) = 2e-16 
Identities = 28/29 (96%) 
Frame = -3 / -1 

I — I 

I I 



Query: 217 VSSPQPPPLKRLLMAKCESQCRP*NNSPS 131 

VSSPQPPPLKRLLMAK ESQCRP*NNSPS |(^ 
Sbjct: 446 VSSPQPPPLKRLIiMAKFESQCRP*NNSPS 360 

Score =64.3 bits (134), Expect (2) = 2e-15 
Identities = 27/29 (93%) 
Frame = +3 / +1 



Query: 129 VLGLLF*GLH*DSHFAIKSLLRGGGCGEL 215 / 

+LGLLF*GLH*DS+FAIKSLLRGGGCGEL / S 

Sbjct: 358 LLGLLF*GLH*DSNFAIKSLLRGGGCGEL 444 

Score =57.9 bits (120), Expect (2) = le-12 
Identities = 27/28 (96%) 

Frame = -2 / -3 

1 I 

1 JL 



Query: 215 QLTATSTS*KAFDGKM*VSMQALE*QPQ 132 

QLTATSTS * KAFDGK+ * VSMQALE*QPQ 
Sbjct: 444 QLTATSTS *KAFDGKI*VSMQALE*QPQ 361 

Score = 45.0 bits (92), Expect(2) = le-18 
Identities = 15/15 (100%) 
Frame = +2 / +2 

I I 
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Query: 86 HHAWPKDLMN*WMYS 130 

HHAWPKDLMN*WMYS 

Sbjct: 314 HHAWPKDLMN*WMYS 358 

Score =44.1 bits (90), Expect (2) = 2e-18 
Identities = 16/18 (88%) 
Frame = -3 / -2 

( I 

I I 



Query: 139 SPSTVHPLIH*ILWPGW 86 

+P+TVHPLIH*ILWPGW 
Sbjct: 367 APATVHPLIH*ILWPGW 314 

Score =40.0 bits (81), Expect (2) = 2e-16 
Identities = 15/17 (88%) 

Frame = -2 / -1 

I I 

I 1 



Query: 137 PQHCTSINSLNPLARRG 87 

P +CTSINSIiNPLARRG 
Sbjct: 365 PSNCTSINSLNPLARRG 315 



Score = 40.0 bits (81), Expect (2) = 2e-15 
Identities = 17/23 (73%) 
Frame = +3 / +3 



Query: 84 ATTPGQRI**INGCTVLGLLF*G 152 

+TTPGQRI**INGCTV G + G 
Sbjct: 312 STTPGQRI**INGCTVAGAVILG 380 

Score =37.3 bits (75), Expect (2) = le-12 
Identities = 14/14 (100%) 

Frame = -1 / -3 
t ' 



Query: 129 LYIH*FIKSFGQAW 88 

LYIH*FIKSFGQAW ' 
Sbjct: 357 LYIH*FIKSFGQAW 316 

Score = 37.3 bits (75), Expect = 0.42 
Identities = 15/17 (88%) 
Frame = +1 / +1 



Query: 88 PRLAKGFNELMDVQCWG 138 

PRLAKGFNELMDVQ G i4 



Sbjct: 316 PRLAKGFNELMDVQLLG 366 
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CPU time: 



0.09 user sees. 



0.02 sys. sees 



0.11 total sees. 



Lambda 



K 



H 



0.318 0.135 0.401 



Matrix: BLOSUM62 

Number of Hits to DB: 581 

Number of Sequences: 0 

Number of extensions : 24 

Number of successful extensions: 12 

Number of sequences better than 10.0: 1 

length of database: 1,668,972,645 

effective HSP length: 54 

effective length of database: 1,368,557,577 
effective search space used: 61585090965 
frameshift window, decay const: 50, 0.1 
T: 13 
A: 40 

XI: 16 ( 7.3 bits) 
X2: 109 (49.9 bits) 
SI: 41 (21.7 bits) 



http://www.ncbi.nlm.nih.gov/blast/bl2seq/>vblast2.cgi? 



3/18/2002 



•piast Result 



Exhibit 4 
SEQID739 



Page 1 of 4 



<^ NCBI 



Blast 2 Sequences results 



PubMed Entrez BLAST OMIM Taxonomy Structure 

BLAST 2 SEQUENCES RESULTS VERSION TBLASTX 2.2.2 [Dec-14-2001] 

Match:!^ ~ Mismatch:!^" gap open:P^ gap extension: 

X dropoff: 1^ expect:P°^ wordsize: 1^ Fjitgr F" 



Sequence 



(1 



Length 300^00^ 



Sequence gi mel9dl0.rl Scares mouse embryo NbME13.5 14.5 Mus , . ^ (1 

2 1375806 musculus cDNA clone IMAGE:387955 5'. "^^^ 342) 




JO- 



NOTE :The statistics (bitscore and expect value) is calculated based on the size of nr database 

Score = 174 bits (375), Expect(2) = le-47 
Identities = 69/82 (84%) 
Frame = +2 / +2 



Query: 53 LRSLTRQLVQDEOTRGVITMNEEYETRFLCNSSQEWKRLGVEQLRLSTVDMTGIPTLDNL 232 

L+++TR+LV DENVRGVITMNEEYETRFLCN+S+EWK+ GVEQLRLSTVDMTG+ PTL NL » 
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Sbjct: 68 LKNOTRRLVLDENVRGVITMNEEYETRFLCNTSKEWKKAGVEQLRLSTVDMTGVPTI^ 247 

Query: 233 QKGVQFALKYQSLGQCVYVHCK 298 

KGVQFALKYQ+LGQCVYVHCK 
Sbjct: 248 HKGVQFALKYQALGQCVYVHCK 313 

Score = 166 bits (356), Expect (2) = 6e-42 
Identities = 67/79 (84%) 
Frame = -l / -l 

j ) 
I I 



Query: 300 ALQCT*THWPSDWYLRANWTPFWRLSKVGIPVMSTVLSRSCSTPSLFHSCEELHRNLVSY 121 

ALQCT*THWPS WYLRANWTP WRL+KVG PVMSTVLSRSCSTP+ FHS E LHRN VSY 
Sbjct: 315 AU3CT*THWPSAWYLRANWTPLWRLAKVGTPVMSTVLSRSCSTPAFFHSFEVLHRNRVSY 136 

Query: 120 SSFMVITPRTFSSCTSWRV 64 

SSF+VITPRTFSS TS RV 
Sbjct: 135 SSFIVITPRTFSSSTSRRV 79 

Score - 146 bits (313), Expect (2) = 4e-38 
Identities = 59/87 (67%) 
Frame = +1 / +1 

r- I 



Query: 40 GALRVAELDAPAGTGRERARGDHHERGVRDEVPVQLFTGVEETRSRXXXXQHSRHDWDPH 219 

G E DAPAGTGRERARGDH+ERGVRD +PVQ G+EE+RS QHSRHDW P+ 

Sbjct: 55 GRAAAEEHDAPAGTGRERARGDHYERGVRDPIPVQHLEGMEESRS*AATAQHSRHDWGPN 234 

Query: 220 LGQPPEGSPICSQVPVAGPVCLRAL*G 300 

LGQ P+ SP+CSQVP GPVCL AL*G 
Sbjct: 235 LGQSPQRSPVCSQVPGTGPVCLCAL*G 315 

Score = 90.4 bits (191), Expect = 4e-17 
Identities = 40/75 (53%) 
Frame = +3 / +3 



JT 



Query: 75 WYRTRTCAXXXXXXXXXXXXXCATLHRSGRD*ESXXXXSAQ*T*LGSPPWTTSRRESNLL 254 

WY TRTCA CAT R+GR E SAQ T*LGS PW S +ES+LL 

Sbjct: 90 WYWTRTCAG*SL*TRSTRPDSCATPRRNGRKQELSSYGSAQST*LGSQPWPISTKESSLL 269 



Query: 255 SSTSRWASVFTCIVR 299 

SST WASV CIVR 
Sbjct: 270 SSTRHWASVSMCIVR 314 

Score = 70.3 bits (147), Expect(3) = 5e-21 
Identities = 28/38 (73%) 
Frame = -3 / -3 

I m 

I ~i 



Query: 298 LTMHVNTLAQRLVLESKLDSLLEWQGGDPSHVYCAEX 185 
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LTMH++TLAQ LVLESKLDS +E+ QG DPSHV CAE 
Sbjct: 313 LTMHIDTLAQCLVLESKLDSFVEIGQGWDPSHVDCAEP 200 

Score = 60.2 bits (125), Expect(2) = 7e-09 
Identities = 33/55 (60%) 
Frame = -2 / -2 



Query: 299 PYNARKHTGPATGT*EQIGLPSGGCPRWGSQSXXXXXXXXXXXLVSSTPVKSCTG 135 

PYNA +HTGP GT*EQ GL G PR G QS L+SS P + CTG 

Sbjct: 314 PYNAHRHTGPVPGT*EQTGLLCGDWPRLGPQSCRLC*AVAAQLLLSSIPSRCCTG 150 

Score =47.8 bits (98), Expect (3) = 5e-21 
Identities = 10/24 (41%) 
Frame = -3 / -3 



Query: 145 VAQXXXXXXXXXXXXXAHVLVLYQ 74 

VAQ AHVLV YQ 

Sbjct: 160 VAQESGLVLLVHSDHPAHVLVQYQ 89 

Score =37.7 bits (76), Expect (2) = le-47 
Identities = 2/15 (13%) 
Frame = +1 / +2 



Query: 4 DWXXXXXXXXXXXXX 48 
DW 

Sbjct: 20 DWYHRIDHTVLLGAL 64 

Score = 34.1 bits (68), Expect (2) = 4e-38 
Identities = 2/19 (10%) 
Frame = +2 / +3 



Query: 5 TGXXXXXXXXXXXXXXXXX 61 
TG 

Sbjct: 21 TGTTASTTRFCWARCR*RT 77 

Score = 27.2 bits (53), Expect(2) = 6e-42 
Identities = 2/15 (13%) 

Frame = -1 / -3 

I I 



Query: 48 XXXXXXXXXXXXXPV 4 

PV 

Sbjct: 64 QRAQQNRWDAWPV 20 

Score =22.6 bits (43), Expect (3) = 5e-21 
Identities = 3/9 (33%) 
Frame = -2 / -1 
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Query: 29 XXXXXXYQS 3 
YQS 

Sbjct: 45 VWSMRWYQS 19 



Score = 21.7 bits (41), Expect (2) = 7e-09 
Identities = 2/7 (28%) 
Frame = -3 / -2 



Query: 25 XXXXXTS 5 
TS 

Sbjct: 41 GRCGGTS 21 

CPU time: 0.10 user sees. 0.03 sys. sees 0.13 total sees. 

Lambda K H 

0.318 0.135 0.401 



Matrix: BLOSUM62 

Number of Hits to DB: 662 

Number of Sequences: 0 

Number of extensions: 21 

Number of successful extensions: 12 

Number of sequences better than 10.0: 1 

length of database: 1,720,680,694 

effective HSP length: 54 

effective length of database: 1,410,958,186 
effective search space used: 63493118370 
frameshift window, decay const: 50, 0.1 
T: 13 
A: 40 

XI: 16 (7.3 bits) 
X2: 109 (49.9 bits) 
SI: 41 (21.7 bits) 
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J> NCB 



I 



Blast 2 Sequences results 

PubMed Entrez BLAST OMIM Taxonomy Structure 

BLAST 2 SEQUENCES RESULTS VERSION TBLASTX 2.2.2 [Dec-14-2001] 



Match:!? J Mis match:! ? — i ga p open:!ll — ■ gap extension: !^ — 

x_dropoff:|5Dexpect:!2gjgg^wordsize:CF^ 

S'^'*"''"" lcl|seq_l 

Sequence gi mnl4al 1 .rl Beddington mouse embryonic region Mus 

2 1668613 musculus cDNA clone IMAGE:537884 5*. 



Length 300 



Length 298 



(1 .. 
300) 

(1 .. 
298) 




NOTE:The statistics (bitscore and expect value) is calculated based on the size of nr database 

Score = 220 bits (474), Expect = 4e-56 
Identities = 91/91 (100%) 
Frame = +3 / +3 



Query: 3 DLCRESDLRHAMANCFEALIGAWLEGSLEEAKQLFGRLLFNDPDLREVWLNYPLHPLQL 182 ^ 

DLCRESDLRHAMANCFEALIGAVYLEGSLEEAKQLFGRLLFl^PDLREVWLNYPLHPLQL / < 



Sbjct: 24 DLCRESDLRHAMANCFEALIGAVYLEGSLEEAKQLFGRLLFNDPDLREVWLNYPLHPLQL 203; 



Query: 183 QEPNTDRQLIETSPVLQKLTEFEEAIGVIFT 275 

QEPNTDRQLIETSPVLQKLTEFEEAIGVIFT 
Sbjct: 204 QEPNTDRQLIETSPVLQKLTEFEEAIGVIFT 296 



Score = 202 bits (435), Expect = 9e-51 
Identities = 89/92 (96%) 
Fra me = -3 / -1 



Query: 277 *VKITPIASSNSVSFCRTGEVSISCRSVFGSCSWSGWRG*LSQTSRRSGSLKSKRPNNCL 98 
*VKITPIASSNSVSFCRTGEVSISCRSVFGSCSWSGWRG*LSQTSRRSGSLKSKRPNNCL 
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Sbjct: 298 *VKITPIASSNSVSFCRTGEVSISCRSVFGSCSWSGWRG*LSQTSRRSGSLKSKRPNNCL 119 

Query: 97 ASSRLPSK*TAPINASKQLAIACRRSDSLQRS 2 

ASSRLPSK*TAPI ASKQLAIA RRSDSL RS 
Sbjct: 118 ASSRLPSK*TAPIKASKQLAIAWRRSDSLHRS 23 

Score = 157 bits (337), Expect = 3e-37 
Identities = 69/91 (75%) 
Frame = +2 / +2 

' » 



Query: 5 PL*RIGPSTCNGQLF*SVNRSCLLGGKPGGSQAVIWTLAL**SGPARSLAQLSSPPTP'rT 184 

P+*RI C+GQLF*S++RSCLLGG+P GSQ VIWTL L**SGP+RSLAQLSSPPTPTT 
Sbjct: 26 PM*RIRSPPCDGQLF*SLDRSCLLGGEPRGSQTVIWTLTL**SGPSRSLAQLSSPPTPTT 205 



Query: 185 RAKY*STTY*NFSSSTKTY*V*RSNWSNFYS 277 

RAKY S TY*NF SST+TY V*RS+WS+ +S 
Sbjct: 206 RAKYGSATY*NFPSSTETYGV*RSDWSDLHS 298 

Score = 149 bits (321), Expect = 5e-35 
Identities = 68/89 (76%) 
Frame = -2 / -3 



Query: 275 SKNYSNCFFKLSKFL*NWRSFNKLSISIWLL*LEWVERIIEPDFAQVRIIKEQASK*LLG 96 

S+++SN FFKL KFL*NW SFNKL I IWLL*LEWVERIIEPDF++VRIIKE ASK*L G 
Sbjct: 296 SEDHSNRFFKLRJCFL*NWGSFNKLPIRIWLL*LEWVERIIEPDFSKVRIIKE*ASK*LFG 117 



Query: 95 FLQASLQVNSSY*RFKTIGHCMSKVRFST 9 

FL A LQVNSSY FKTIGH M+++ FST 
Sbjct: 116 FL*APLQVNSSYQGFKTIGHRMAEI*FST 30 

Score = 149 bits (320), Expect = 7e-35 
Identities = 69/88 (78%) 
Frame = +1 / +1 

1 ' 



Query: 4 TFVENRTFDMQWPIVLKR**ELFTWREAWRKPSSYLDACSLMIRTCAKSGSIILSTHSNY 183 

T+VEN+ M+WPIVLK **ELFTWR A RKP+SYLDA SLMIRT KSGSIILSTHSNY 
Sbjct: 25 TYVENQISAMRWPIVLKP**ELFTWRGA*RKPNSYLDAYSLMIRTFEKSGSIILSTHSNY 204 



Query: 184 KSQILIDNLLKLLQFYKNLLSLKKQLE* 267 

KSQI I NLLKL QFY+NL SLKK+LE* 
Sbjct: 205 KSQIRIGNLLKLPQFYRNLRSLKKRLE* 288 

Score = 146 bits (313), Expect = 6e-34 
Identities = 69/92 (75%) 
Frame = -1 / -2 

I 

I 



Query: 276 E*KLLQLLLQTQ*VFVELEKFQ*VVDQYIAIiWGVGGEDN*ARLRAGPDH*RASVQITAW 97 
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SbjCt: 



297 



E*+ LQ LLQT *V VEL KFQ*V D YIiALWGVGGEDN*ARL GPDH*R SVQIT W 
E*RSLQSLLQTP*VSVELGKFQ*VADPYLALVVGVGGEDN*ARLLEGPDH*RVSVQITVW 118 



Query: 



96 



LPPGFPPSKQLLLTLQNNWPLHVEGPILYKGQ 1 
LP G PPSKQLL LQNNWP H ILY G+ 
LPLGSPPSKQLLSRLQNNWPSHGGDLILYIGR 22 



Sbjct: 



117 



CPU time: 



0.05 user sees. 



0 , 06 sys . sees 



0.11 total sees. 



Lambda 



K 



H 



0.318 0.135 0.401 



Matrix: BLOSUM62 

Number of Hits to DB: 638 

Number of Sequences: 0 

Number of extensions: 19 

Number of successful extensions: 6 

Number of sequences better than 10.0: 1 

length of database: 1,668,972,645 

effective HSP length: 54 

effective length of database: 1,368,557,577 
effective search space used: 61585090965 
frameshift window, decay const: 50, 0.1 

T: 13 
A: 40 

XI: 16 ( 7.3 bits) 
X2: 109 (49.9 bits) 
SI: 41 (21.7 bits) 
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Page 1 of 2 



NCBI 



Blast 2 Sequences results 

PubMed Entrez BLAST OMIM Taxonomy Structure 

BLAST 2 SEQUENCES RESULTS VERSION TBLASTX 2.2.2 [Dec-14-2001J 



gap openJll -J gap extension: 



Matchrlr — J Mis match:!, ^ ^ 

x^poffiEexpect:!!"^ IfM 



S^^"^"^^ lcl|seq_l 



Length 300 (1 300) 



Sequence gi Human fibrinogen gamma chain/fibrinogen alpha eth 13134^^ " 

2 1117833 chain genes, intergenic region. ^ 13134) 



NOTE:The statistics (bitscore and expect value) is calculated based on the size of nr database 

Score = 38.6 bits (78), Expect = 0.16 
Identities = 13/22 (59%) 
Frame = +1 / +2 



Query: 43 Y*L*SPCCTLEKKTYFSCLSKT 108 

Y L*SPCCTL+ +TY C+++T 
Sbjct: 11663 YQL*SPCCTLDSRTYLYCITQT 11728 

Score =35.9 bits (72), Expect =1.1 
Identities = 15/34 (44%) 
Frame = +2 / +3 



n 



Query: 35 YKIINYSHHAVPWKRKPTFLA*VKLLPFSRTGGP 136 

+ IINYSHHA W + ++A*+KL T P t 

Sbjct: 11655 HNIINYSHHAAHWTPELIYIA*LKLCAL*PTSPP 11756 



CPU time: 

Lambda 
0.318 



K 



0.06 user sees. 
H 



0.03 sys. sees 



0.135 



0.401 



0.09 total sees. 



Matrix: BLOSUM62 
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Number of Hits to DB: 11,835 

Number of Sequences: 0 

Number of extensions: 205 

Number of successful extensions: 10 

Number of sequences better than 10.0: 1 

length of database: 1,668,972,645 

effective HSP length: 54 

effective length of database: 1,368,557,577 
effective search space used: 61585090965 
frameshift window, decay const: 50, 0.1 
T: 13 
A: 40 

XI: 16 ( 7.3 bits) 
X2: 109 (49.9 bits) 
SI: 41 (21.7 bits) 
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Exhi| 
SEQ IDT899 
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Blast 2 Sequences results 

PubMed Entrez BLAST OMIM Taxonomy Structure 

BLAST 2 SEQUENCES RESULTS VERSION TBLASTX 2.2.2 [Dec-14-20011 

Match:l9 \ MismatchJ^^ — I gap open:lll --i gap extension: V. 



t_dropoffillDexpect:l!Ogwordsize:OFiltgrP Sill 
S'^"*'''*^*^ lcl|seq_l 

Sequence gi Human XP 1 PO ribosomal protein S3 (rpS3) mRNA, 

2 555940 complete cds. 

S 2 



Length 209 



Length 754 



(1 

754) 



□L 




NOTE:The statistics (bitscore and expect value) is calculated based on the size of nr database 

Score = 108 bits (231), Expect = 5e-23 
Identities = 44/69 (63%) 
Frame = -3 / -3 

1 



JL 



Query: 207 RRTCYAVGTGWGMAEGSGFPPFCSEMGWGSISSMGSTMMPWDPTGNRGPKKPLPDHVSI 28 

RR CYAVGTGWGMA GSGFPPFCSEMGWGSISS+GSTM+ W G GP P+ S+ f "^j 
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Sbjct: 743 RRPCYAVGTGWGMAGGSGFPPFCSEMGVVGSISSLGSTMLTWSGRGFLGPILPVGSQGSM 564 



Query: 27 WRLALRSP 1 

+ L +P 
Sbjct: 563 IFTLMPSTP 537 

Score = 101 bits (215) , Expect = 7e-21 
Identities = 45/69 (65%) 
Frame = +2 / +1 



Query: 2 GLLRANLTTMLTWSGRGFLGPLLPVGSQGIIVEPIDEILPTTPISEQKGGKPEPSAMPQP 181 

G+L + ML W G +GP P+ IVEP DEILPTTPISEQKGGKPEP AMPQP 

Sbjct: 538 GVLGIKVKIMLPWDPTGKIGPKKPLPDHVSIVEPKDEILPTTPISEQKGGKPEPPAMPQP 717 



Query: 182 VPTA*QVLL 208 

VPTA*Q LL 
Sbjct: 718 VPTA*QGLL 744 



Score = 100 bits (213), Expect = le-20 
Identities = 36/39 (92%) 
Frame = -1 / -1 



Query: 209 QGEPVMLWELAGAWQKALASHPSVLRWGWWAVSHLWVPQ 93 

QG+PVMLW LAGAWQ ALASHPSVLRWGWWAVSHLWVPQ P 
Sbjct: 745 QGDPVMLWGLAGAWQAALASHPSVLRWGWWAVSHLWVPQ 629 

Score = 100 bits (213), Expect = le-20 
Identities = 36/39 (92%) 

Frame - -2 / -2 

I I 

I I 



Query: 208 KENLLCCGNWLGHGRRLWLPTLLF*DGGGGQYLIYGFHN 92 

KE LLCCG+WLGHGRRLWLPTLLF*DGGGGQYLI+GFHN 
Sbjct: 744 KETLLCCGDWLGHGRRLWLPTLLF*DGGGGQYLIFGFHN 628 



Score =99.1 bits (210), Expect = 4e-20 
Identities = 37/42 (88%) 
Frame = +1 / +3 



Query: 82 PGHHCGTHR*DTAHHPHLRTEGWEARAFCHAPASSHSITGSP 207 

P HCGT R*DTAHHPHLRTEGWEARA CHAPAS HSITGSP 
Sbjct: 618 PREHCGTQR*DTAHHPHLRTEGWEARAACHAPASPHSITGSP 743 



Score = 93.2 bits (197), Expect = 2e-18 
Identities = 35/40 (87%) 
Frame = +3 / +2 
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Query: 90 SLWNP*MRYCPPPPSQNRRVGSQSLLPCPSQFPQHNRFSL 209 

+LWNP MRYCPPPPSQNRRVGSQS LPCPSQ PQHNR SL , 0 

Sbjct: 626 ALWNPKMRYCPPPPSQNRRVGSQSRLPCPSQSPQHNRVSL 745 

Score =56.5 bits (117), Expect = 2e-07 
Identities = 20/23 (86%) 
Frame = -3 / +1 



Query: 93 MMPWDPTGNRGPKKPLPDHVSIV 25 

M+PWDPTG GPKKPLPDHVSIV 
Sbjct: 565 MLPWDPTGKIGPKKPLPDHVSIV 633 

Score 55.6 bits (115), Expect = 5e-07 
Identities = 20/25 (80%) 
Frame = +2 / -3 

I I 

I I 



Query: 23 TTMLTWSGRGFLGPLLPVGSQGI I V 97 

+TMLTWSGRGFLGP+LPVGSQG ++ 
Sbjct: 635 STMLTWSGRGFLGPILPVGSQGSMI 561 

Score = 55.1 bits (114), Expect = 6e-07 
Identities = 20/24 (83%) 
Frame = +1 / - 2 

I I 

I 1 



Query: 22 YHNAHWRQGLLRAPVTSWVPGHH 93 

+HNAHVVRQGLLRA +TSWVPG H 
Sbjct: 636 FHNAHWRQGLLRANLTSWVPGQH 565 

Score =51.9 bits (107), Expect = 6e-06 
Identities = 21/27 (77%) 
Frame = -2 / +3 

C 

T 



Query: 103 GFHNDALGPNW*QGP*EAPA*PREHCG 23 

G + ALGPNW* P*EAPA*PREHCG 
Sbjct: 555 GEDHAALGPNW*DWP*EAPA*PREHCG 635 

Score = 51.5 bits (106), Expect = 8e-06 
Identities = 22/29 (75%) 
Frame = +3 / -1 

I t 

I I 



Query: 21 LPQCSRGQAGAS*GPCYQLGPRASLWNP* 107 

+PQCSRGQAGAS*G YQLGPRA+ +P* 
Sbjct: 637 VPQCSRGQAGAS*GQSYQLGPRAA*SSP* 551 

Score = 49.6 bits (102), Expect = 3e-05 
Identities = 20/22 (90%) 
Frame = -1 / +2 

I 1 
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Query: 89 CPGTQLVTGALRSPCLTT*ALW 24 

CPGTQLV ALRSPCLTT*ALW 
Sbjct: 569 CPGTQLVRLALRSPCLTT*ALW 634 

CPU time: 0.10 user sees. 0.01 sys. sees 0.11 total sees. 

Lambda K H 

0.318 0.135 0.401 



Matrix: BIjOSUM62 

Number of Hits to DB: 1171 

Number of Sequences: 0 

Number of extensions: 38 

Number of successful extensions: 20 

Number of sequences better than 10.0: 1 

length of database: 1,668,972,645 

effective HSP length: 51 

effective length of database: 1,261,711,380 
effective search space used: 22710804840 
frameshift window, decay const: 50, 0.1 
T: 13 
A: 40 

XI: 16 ( 7.3 bits) 
X2: 109 (49.9 bits) 
SI: 41 (21.7 bits) 
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SEQ ID 2007 
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NCB 



Blast 2 S quences results 

PubMed Entrez BLAST OMIM Taxonomy Structure 

BLAST 2 SEQUENCES RESULTS VERSION TBLASTX 2,2,2 IDec-14.2001] 



Match:I?_J Mismatch:!?^ 



x_dropoff: ^"^ J expect 



^.|10.000( 



J gap openJlj--. gap extension: U 



wordsize:!? ^ Filter 



Sequence 1 lcl|seq_l Length 300 (1 300) 
Sequence 2 gi 397648 Mus musculus B6D2F1 clone 2C#4 mRNA. Length 962 (1 962) 
I I 2 




NOTE:The statistics (bitscore and expect value) is calculated based on the size of nr database 

Score = 154 bits (330), Expect = le-35 
Identities = 58/70 (82%) 

Frame = +2 / +1 

I -1 
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Query: 89 ERDKNQSSSKRKVEQFWRFYSHMVRPGDLTGHSDFHLFKEGIKPMWEDDANKNGGKWIIR 268 

E++ Q + VEQFW+FYSHMVRPGDLTGHSDFHLFKEGIKPMWEDD NKNGGKWIIR ^ ( 

Sbjct: 268 EQNIKQIGTFPSVEQFWKFYSHMVRPGDLTGHSDFHLFKEGIKPMWEDDPNKNGGKWIIR 447 

Query: 269 LRKGLASRCW 298 

LRKGLASRCW 
Sbjct: 448 LRKGLASRCW 477 

Score ^ 133 bits (286), Expect = 3e-30 
Identities = 54/59 (91%) 
Frame = -1 / -1 

I 

I I 



Query: 300 SQQREAKPFRSRIIHLPPFLFASSSHMGLIPSLKRWKSLWPVRSPGRTMWL*NLQNCST 124 

SQQREAKPFRSR+IHLPPFLF SSSH+GLIPSLKR KSLWPVRSPGRTMWL*N QNCST 
Sbjct: 479 SQQREAKPFRSRMIHLPPFLFGSSSHIGLIPSLKR*KSLWPVRSPGRTMWL*NFQNCST 303 

Score = 112 bits (240), Expect = 7e-24 
Identities = 47/58 (81%) 
Frame = -2 / -2 

I I 

I I 



Query: 299 PSNGRPSPSAAE*STCHHFYLHHPPTWV*FLL*RDGSHCGLSGPQDVPCGYKTSRTAP 126 

PS+G+ SPS E*STC HFYL HPPT V* LL*RD SHCGLSGPQDVPCG KTSRTAP 
Sbjct: 478 PSSGKLSPSGVE*STCPHFYLDHPPT*V*SLL*RDESHCGLSGPQDVPCGCKTSRTAP 305 

Score = 110 bits (234), Expect ^ 5e-23 
Identities = 45/58 (77%) 
Frame = -3 / -3 

I t 

I I 



Query: 298 PATGGQALPQPNNPLATIFICIILPHGFNSFFEEMEVTVACQVPRTYHVAIKPPELLH 125 

PA G ALP+ N+PLA IFI IILPH FN FFEEM+VTVACQVPRTYHVA+K PELLH 
Sbjct: 477 PAAGS*ALPESNDPLAPIFIWIILPHRFNPFFEEMKVTVACQVPRTYHVAVKLPELLH 304 

Score = 109 bits (232), Expect = 9e-23 
Identities = 47/58 (81%) 
Frame = +1 / +3 

' ! 

I I 



Query: 127 GAVLEVL*PHGTSWGPDRPQ*LPSLQRRN*THVGG*CK*KWWQVDYSAAEGLGLPLLG 300 

GAVLEVL PHGTSWGPDRPQ*L SLQRR+*T+VGG* K*KW QVD+S EGL PLLG 
Sbjct: 306 GAVLEVLQPHGTSWGPDRPQ*LSSLQRRD*TYVGG*SK*KWGQVDHSTPEGLSFPLLG 479 

Score = 102 bits (217), Expect(2) = 3e-33 
Identities = 48/74 (64%) 
Frame = +3 / +2 

I I 



Query: 78 RKKRNETRIRAVARERWSSSGGFIATWYVLGT*QATVTSISSKKELNPCGRMMQIKMVAS 257 
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R+ + T R WSSSG F ATWYVLGT*QATVT ISSKK LN CGRM+QIKM AS 

Sbjct: 257 RRAMSRTSSRLAPFPLWSSSGSFTATVrirVIX3T*QATVTFISSKKGLNLCGR^^ 436 



Query: 258 GLFGCGRAWPPVAG 299 

G F GRA P AG 
Sbjct: 437 GSFDSGRA*LPAAG 478 



Score = 74.4 bits (156), Expect (2) = 6e-14 
Identities = 26/30 (86%) 
Frame = -2 / -3 



Query: 92 VPFFPSHHLSVCCFLHSDHGPHCHHLSKRR 3 

V FFPSHHLSVCCFLHSDHG HCHHL +RR 
Sbjct: 135 VLFFPSHHLSVCCFLHSDHGLHCHHLLRRR 46 



Score = 74.4 bits (156), Expect(2) = le-12 
Identities = 29/29 (100%) 
Fra me = +2 / +3 
1 



I 1 



Query: 2 FDALKDDDSGDHDQNEENSTQKDGEKEKT 88 ^ 
FDALKDDDSGDHDQNEENSTQKDGEKEKT ^ I 

Sbjct: 45 FDALKDDDSGDHDQNEENSTQKDGEKEKT 131 



Score = 70.3 bits (147), Expect(2) = 7e-12 
Identities = 29/30 (96%) 

Frame = -1 / -2 

I I. 



Query: 90 SVFSFSPSFCVLFSSF*SWSPLSSSFKASN 1 

+VFSFSPSFCVLFSSF*SWSPLSSSFKASN 
Sbjct: 133 TVFSFSPSFCVLFSSF*SWSPLSSSFKASN 44 



Score =62.9 bits (131), Expect (2) = 3e-33 
Identities = 27/31 (87%) 
F rame = +1 / +2 

I 

I I 



Query: 1 VRRFER**QWGP*SE*RKQHTERW*EGKNGT 93 

VRR +R**QW P*SE*RKQHTERW*EGKN T 
Sbjct: 44 VRRLKR**QWRP*SE*RKQHTERW*EGKNST 136 



Score = 58.3 bits (121), Expect = 2e-07 
Identities = 26/41 (63%) 
F rame = + 3 / +1 

I I 



Query: 3 STL*KMMTVGTMIRMKKTAHRKMVRRKKRNETRIRAVARER 125 

ST *KMMTV TMIRMKKTAHRKMVRRKK+ + ++ + + 
Sbjct: 46 STP*KMMTVETMIRMKKTAHRKMVRRKKQYRDKSQSSGKRK 168 
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Score = 57.4 bits (119), Expect = 4e-07 
Identities - 26/34 (76%) 
Frame = -3 / -1 

< I 

' ' 



Query: 103 ILVSFRFFLLTIFLCAVFFILIMVPTVIIFQSVE 2 

+L + FFLLTIFLCAVFFILIMV TVIIF VE 
Sbjct: 146 LLSRYCFFLLTIFLCAVFFILIMVSTVIIF*GVE 45 

Score = 25.8 bits (50), Expect (2) - 6e-14 
Identities = 9/15 (60%) 
Frame = -2 / -2 



Query: 125 PFSCYCSDSCLVPFF 81 

P SC+CS SCL F 
Sbjct: 169 PSSCHCSGSCLGTVF 125 

Score =23.1 bits (44), Expect (2) = 7e-12 
Identities = 11/15 (73%) 
Frame = -3 / -3 



Query: 124 LSLATALILVSFRFF 80 

L LATAL LVS FF 
Sbjct: 168 LPLATALALVSVLFF 124 

Score =22.6 bits (43), Expect (2) = le-12 
Identities = 10/15 (66%) 
Frame = +3 / +2 

f — I 

□ 



Query: 81 KKRNETRIRAVARER 125 

K ETR RAVAR R 
Sbjct: 125 KNSTETRARAVARGR 169 

CPU time: 0.09 user sees. 0.03 sys. sees 0.12 total sees. 

Lambda K H 

0.318 0.135 0.401 



Matrix: BLOSUM62 

Number of Hits to DB: 1323 

Number of Sequences: 0 

Number of extensions: 37 

Number of successful extensions: 16 

Number of sequences better than 10.0: 1 

length of database: 1,668,972,645 

effective HSP length: 54 

effective length of database: 1,368,557,577 
effective search space used: 61585090965 
frameshift window, decay const: 50, 0.1 



http://www.ncbi.nlm.nih.gov/blast/bl2seq/wblast2.cgi? 



3/18/2002 



Blast Result Page 5 of 5 



T: 13 
A: 40 

XI: 16 ( 7.3 bits) 
X2: 109 (49.9 bits) 
SI: 41 (21.7 bits) 



http://www.ncbi.nlm.nih.gov/blast/bl2seq/wblast2.cgi? 



3/18/2002 



